Text Detection and Extraction from Document Images using K-Nearest Neighbor Rule

نویسنده

  • Mallikka Rajalingam
چکیده

Text extraction and text line detection is the foundation of document image analysis. Since many years, a large number of text detection methods have been proposed, where these methods depend on convinced assumptions of documents with various font style, font size, distorted text, uneven lighting, complex background and low resolution. In this paper, reveals k-nearest neighbor rule as a generic text-line detection and text extraction approach that can be applied on a complex mail document images. The performance evaluation of transition map generation and it compares with other two models is presented in this paper. Experimental analysis shows that image based text Optical Character Recognition (OCR) method is to extract the text from the colorful image and detection of advertised mails is very efficient than that of the other existing methods. KeywordsText extraction, Text-line detection, KNN rule, and mail document image.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Improved K-Nearest Neighbor with Crow Search Algorithm for Feature Selection in Text Documents Classification

The Internet provides easy access to a kind of library resources. However, classification of documents from a large amount of data is still an issue and demands time and energy to find certain documents. Classification of similar documents in specific classes of data can reduce the time for searching the required data, particularly text documents. This is further facilitated by using Artificial...

متن کامل

An Improved K-Nearest Neighbor with Crow Search Algorithm for Feature Selection in Text Documents Classification

The Internet provides easy access to a kind of library resources. However, classification of documents from a large amount of data is still an issue and demands time and energy to find certain documents. Classification of similar documents in specific classes of data can reduce the time for searching the required data, particularly text documents. This is further facilitated by using Artificial...

متن کامل

Extraction of Suitable Features for Breast Cancer Detection Using Dynamic Analysis of Thermographic Images

Introduction: Thermography is a non-invasive imaging technique that can be used to diagnose breast cancer. In this study, a method was presented for the extraction of suitable features in dynamic thermographic images of breast. The extracted features can help classify thermographic images as cancerous or healthy. Method: In this descriptive-analytical study, the images were taken from the IC/UF...

متن کامل

Edge Detection Based On Nearest Neighbor Linear Cellular Automata Rules and Fuzzy Rule Based System

 Edge Detection is an important task for sharpening the boundary of images to detect the region of interest. This paper applies a linear cellular automata rules and a Mamdani Fuzzy inference model for edge detection in both monochromatic and the RGB images. In the uniform cellular automata a transition matrix has been developed for edge detection. The Results have been compared to the ...

متن کامل

Extraction of Suitable Features for Breast Cancer Detection Using Dynamic Analysis of Thermographic Images

Introduction: Thermography is a non-invasive imaging technique that can be used to diagnose breast cancer. In this study, a method was presented for the extraction of suitable features in dynamic thermographic images of breast. The extracted features can help classify thermographic images as cancerous or healthy. Method: In this descriptive-analytical study, the images were taken from the IC/UF...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014